Amyloid-β prediction machine learning model using source-based morphometry across neurocognitive disorders

Previous studies have developed and explored magnetic resonance imaging (MRI)-based machine learning models for predicting Alzheimer’s disease (AD). However, limited research has focused on models incorporating diverse patient populations. This study aimed to build a clinically useful prediction model for amyloid-beta (Aβ) deposition using source-based morphometry, using a data-driven algorithm based on independent component analyses. Additionally, we assessed how the predictive accuracies varied with the feature combinations. Data from 118 participants clinically diagnosed with various conditions such as AD, mild cognitive impairment, frontotemporal lobar degeneration, corticobasal syndrome, progressive supranuclear palsy, and psychiatric disorders, as well as healthy controls were used for the development of the model. We used structural MR images, cognitive test results, and apolipoprotein E status for feature selection. Three-dimensional T1-weighted images were preprocessed into voxel-based gray matter images and then subjected to source-based morphometry. We used a support vector machine as a classifier. We applied SHapley Additive exPlanations, a game-theoretical approach, to ensure model accountability. The final model that was based on MR-images, cognitive test results, and apolipoprotein E status yielded 89.8% accuracy and a receiver operating characteristic curve of 0.888. The model based on MR-images alone showed 84.7% accuracy. Aβ-positivity was correctly detected in non-AD patients. One of the seven independent components derived from source-based morphometry was considered to represent an AD-related gray matter volume pattern and showed the strongest impact on the model output. Aβ-positivity across neurological and psychiatric disorders was predicted with moderate-to-high accuracy and was associated with a probable AD-related gray matter volume pattern. An MRI-based data-driven machine learning approach can be beneficial as a diagnostic aid.


Standard protocol approval, registration, and patient consent
The Certified Review Board of Keio University approved the study design and protocol.The study was registered with the University Hospital Medical Information Network Clinical Trials Registry (UMIN-CTR; https:// www.umin.ac.jp/ ctr/ index.htm, ID# UMIN000032027, the first registration: 31/03/2018) and the Japan Registry of Clinical Trials (jRCT; https:// jrct.niph.go.jp/, ID# jRCTs031180225, the first registration: 11/03/2019), and was conducted in accordance with the 1964 Declaration of Helsinki and its later amendments.All participants and their proxies, if necessary, provided written informed consent.

Apolipoprotein E (APOE) genotyping
Genomic DNA was extracted from 0.2 mL whole blood using a Magnetic Nanoparticle DNA Extraction Kit (EZ1 DNA Blood 200 μL Kit).APOE genotyping (rs429358 and rs7412) was performed by real-time polymerase chain reaction (PCR) using the TaqMan probe on a CFX 96 deep well Real-Time PCR system (Bio Rad, Richmond, CA) to analyze the three major isoforms (APOE ε2, ε3, and ε4).
[ 18 F] Florbetaben (FBB) amyloid-PET imaging [ 18 F] FBB was manufactured on-site using an automated synthesizer as described elsewhere 45,46 .Amyloid-PET images were acquired for 20 min using a PET-CT (True Point Biograph 40/64, Siemens Japan K.K., Tokyo, Japan), 90 min after intravenous injection of 300 MBq ± 20% [ 18 F] FBB.The 20-min PET images were visually assessed by nuclear medicine experts who had completed a training program offered by the manufacturer (Piramal Imaging GmbH, Berlin, Germany).The Aβ positivity/negativity was determined based on the assessment of tracer uptake in the GM of the following four brain regions: the lateral temporal lobes, frontal lobes, posterior cingulate cortex/ precuneus, and parietal lobes, in line with the NeuraCeq™ guidelines (http:// www.acces sdata.fda.gov/ drugs atfda_ docs/ label/ 2014/ 20467 7s000 lbl.pdf) 47 .Aβ negativity was established when tracer uptake (i.e., signal intensity) in the GM was lower than that in the white matter (WM) in all four brain regions.

MRI pre-processing
Structural brain images were first segmented into GM, WM, and CSF using the Statistical Parametric Mapping (SPM12; Wellcome Trust Center for Neuroimaging, London, UK) toolbox CAT12 (http:// www.neuro.uni-jena.de/ cat/) in MATLAB (R2019a; MathWorks, Natick, Mass, USA).Segmented GM images were used to normalize the individual component images to the Montreal Neurological Institute (MNI) template 48 .Normalized images were modulated to preserve the total amount of signal from each voxel, resampled to an isotropic voxel size of 2 × 2 × 2 mm 3 , and smoothed using a 5-mm full-width-at-half-maximum Gaussian kernel.
For the subsequent pre-processing, we used SBM 30,35 .SBM incorporates independent component analysis (ICA) and provides automatic decomposition of a given set of anatomical brain images into independent spatial maps characterizing different modes of anatomical variability across all individuals 30,35 .
The preprocessed GM images were loaded with Nibabel (https:// nipy.org), and a three-dimensional (3D) array of 91 × 109 × 91 voxels was transformed into a one-dimensional (1D) array of 1 × 902,629 voxels.We created a brain mask for this 1D array using the Neuromorphometric Atlas (provided by Neuromorphometrics, Inc. (http:// Neuro morph ometr ics.com)) 49 and selected 208,082 voxels on which ICA was performed for all scans using the FastICA function of scikit-learn (https:// scikit-learn.org/ stable/), a Python machine learning library.The number of extracted independent components (ICs) was also used as a definitive hyperparameter to be tuned in subsequent model building.
After conducting the ICA, we reshaped the data matrix (i.e., ICs) back into a 3D image (91 × 109 × 91) using nipy (https:// nipy.org).The 3D image was then superimposed onto the MNI-normalized template brain using BrainNet Viewer 50 , for visualization.The extracted ICs were used as spatial regressors for each participant's GM images (I GM ).
In the above formula, each β represents the weighting coefficient associated with the effect of each IC for the GM image and K indicates the number of extracted ICs.Accordingly, the β-values could be loosely regarded as "weighted total gray matter volume" of the brain parcel represented by the given IC 51 .The β-values were then used as representative GM measures associated with each component, in the subsequent analyses.

Machine learning
We built predictive models for Aβ-positivity using scikit-learn (https:// scikit-learn.org/ stable/ index.html) 52 which is supported by Python ver.3.4.The input feature values were based on the ICA's β-values, demographic characteristics (i.e., age and sex), cognitive assessments, and APOE genotype.First, we used all input features and built the final model.Second, we investigated the model performance for each combination of features (e.g., brain images alone, brain images and cognitive assessments).Third, we investigated model performance for each combination of diagnoses (e.g., AD + HC and AD + MCI + HC).
Throughout the model building, we used a Gaussian kernel support vector machine (SVM) as the classifier and the model was validated using fivefold cross-validation (Additional Fig. 1).For a fivefold training/test split, the model was fitted to the training data, and the predictive value was assessed using the test data over all splits (five times).We tuned the hyperparameters (i.e., Gamma and C in SVM and the number of ICs) with a grid search in all model buildings.
To improve the interpretability of the model, we applied the SHapley Additive exPlanations (SHAP) (https:// shap.readt hedocs.io/ en/ latest/ index.html) which makes the output of any machine learning model explainable as a model itself 53 .Based on the Shapley value in game theory, a large absolute SHAP value has a strong influence on the prediction.In the present study, the clinical features with positive and negative SHAP values were associated with Aβ-positivity and Aβ-negativity, respectively.

Statistical analysis
For the statistical analyses, we used Scipy (https:// www.scipy.org), supported by Python version 3.4.Demographic and clinical variables were compared using a two-tailed t-test, or chi-square test, where appropriate.Relationships among features were examined using Pearson's correlation analysis for continuous variables.Analysis of variance (ANOVA) was conducted to determine associations with diagnoses.Statistical significance was defined by a p-value of < 0.01 or < 0.05 after the Bonferroni correction for multiple corrections.Aβ-negative (Table 1).The demographic and clinical characteristics are shown in Table 1.
Table 3 shows the performance of the final model to predict Aβ positivity in each diagnosis.The final model achieved an accuracy of 89.8% when including all the participants.The accuracy of the model based on AD, MCI, and HC was slightly lower (i.e.88.4%), whereas that based solely on MCI was the lowest (i.e.75.9%).Notably, Aβ-positivity/negativity was completely (i.e.100%) identified in FTLD syndromes and in psychiatric disorders.

SBM
Seven ICs (IC 1-7) were derived from the final SBM model (Table 4 and Additional Fig. 2).Each component showed spatially maximally independent GM volume patterns.Upon examining the relationship between each component and clinical information, IC 1 showed a significant correlation with cognitive measures and Aβ-positivity.Meanwhile, IC 4 was significantly correlated with age (Table 4).
We assessed whether each clinical diagnosis was associated with the ICs.Only AD-diagnosis and IC 1 showed a significant association (Games-Howell test was applied for multiple comparisons, p < 0.001), whereas the other diagnoses were not associated with any ICs.The GM volume pattern of IC 1 is shown in Fig. 2. The spatial pattern of the loading coefficients from IC 1 showed higher z-scores in the lateral parietal lobes than in the other ICs.

Feature importance of the model
The SHAP values were calculated (Fig. 3), in which IC 1 showed the strongest impact on the model, followed by Logical Memory I and II, IC 3, and APOE x/4.

Discussion
Using SBM, our machine learning model predicted Aβ-positivity with an accuracy of 89.8% and an AUC of 0.888 based on brain MRI, cognitive, and genetic data from 118 participants.It also correctly predicted Aβ-positivity/ negativity in non-AD participants, such as those with FTLD syndrome and psychiatric disorders.Even a model based solely on brain images achieved 84.7% accuracy and an AUC of 0.830.Among all the covariates in the final model, IC 1 had the strongest impact related to Aβ-positivity prediction, followed by Logical Memory I and II.This suggests that our model may be beneficial in clinical settings.

Model performance
Our model yielded the best accuracy (i.e.89.8%) when it included non-AD cases, whereas the model based only on the AD continuum achieved slightly lower accuracy (i.e.88.4%).It can be interpreted that the heterogeneity of clinical features among non-AD participants was informative in refining the accuracy of the final model.While numerous machine learning models based on brain images have been developed, most of them have focused on the clinically determined AD continuum 20,[24][25][26][27] , and predicted the clinical diagnoses of AD instead of imaging/pathology-based Aβ deposition 18,28 .
As patients visiting physicians' offices would have various neurocognitive disorders beyond the AD continuum 18,26,27 , our model, which was based on diverse clinical populations may be better suited for application in clinical settings.Even our model, based only on structural brain images which yielded an 84.7% accuracy, may assist clinicians' deciding and screening of potential candidates for AD-related clinical trials.These results www.nature.com/scientificreports/may be due to the advantages of SBM, namely its ability to detect subtle morphological changes and unknown patterns in brain structures associated with neurodegenerative diseases without relying on existing atlases 30,35 .These strengths could be exploited in a patient population with diversified diseases, as in this study.
Our model achieved a predictive accuracy of 75.9% for Aβ-positivity in individuals with MCI.Notably, it surpassed the accuracy of the physicians' clinical diagnosis of AD, which is approximately 70% 3 .Furthermore, our model demonstrated predictive accuracy comparable to previous studies that aimed to predict Aβ-positivity 26 or future AD diagnosis in MCI patients using structural MRI 20 .
While no definitive treatment is currently available to slow the progression of AD 54 , new drugs aimed at disease-modifying therapies are being approved in some countries 55 .In the context of the growing availability of disease-modifying drugs for AD, accurate and early diagnosis will become a higher priority 55 .Although Aβ deposition is one of the earliest detectable pathological changes in AD 2,6,8,19 , its detection by PET or CSF test may be hampered by the need for specialized facilities, length of time required, or some degree of invasiveness or risk [14][15][16] .Since MRI is safe and applicable to a wide population, an MRI-based Aβ prediction model based on a heterogeneous population may be valuable for clinicians.

Feature importance
SHAP analyses indicated that IC 1, LM I, and LM II were important predictive features.These three leading features showed two or more strong impacts compared to the others.
IC 1, the most important feature in our model, was significantly correlated with Aβ-positivity (r = 0.516) and most of the cognitive measures included in the analyses, as shown in Table 4. Furthermore, the spatial pattern of the loading coefficients from IC 1 roughly followed the "cortical pattern" of neurodegeneration in AD that is characterized by cortical atrophy, particularly in the parietal lobe 56 as depicted in Fig. 1.The parietal lobe, including the precuneus, is known to contribute to episodic memory [57][58][59] which is likely to be impaired in AD 60,61 , and is possibly associated with Aβ pathology 62,63 .In our study, however, another "typical AD" pattern 56 , medial temporal lobe (MTL) atrophy 64 , was not observed in any IC.One possibility is that MTL atrophy does not necessarily indicate Aβ pathology, but may be a signal for tau pathology, such as primary age-related tauopathy 65 or coexistent transactive response DNA-binding protein 43 pathology 66 .These clinicopathological relationships may explain why IC 1 was of greater importance in the prediction and represented the AD-related GM volume pattern.
The importance of Logical Memory scores indicated that memory impairment, a typical cardinal symptom of AD 67 , will also be essential for prediction.
Interestingly, all ICs showed greater importance than demographic and cognitive features, including scores on the MMSE, an assessment scale suitable primarily for screening for dementia.Among the ICs, IC 4 was uniquely extracted as a normal aging GM volume pattern (Additional Fig. 3) and lacked any significant association with cognitive measures or Aβ-positivity (Table 4).The separate associations between IC 1 and Aβ-positivity and between IC 4 and age might indicate that our model discriminates AD-related neurodegeneration from normal aging in brain imaging.These results imply that the pathological process of AD is not necessarily age dependent.In other words, brain atrophy patterns in normal aging processes can be distinguished from those in neurodegenerative diseases 51 , even though the deposition of Aβ plaques is likely to increase with age, and several age-related pathologies may be comorbid with AD 69,70 .
Overall, the SHAP analyses imply that SBM-derived GM volume patterns and Logical Memory results might be important for predicting Aβ-positivity across diverse neurocognitive disorders.

Limitation
This study has some limitations.First, Aβ-positivity was determined only by amyloid-PET scan, whereas CSF Aβ would be a more sensitive marker, particularly in the pre-clinical status 9 .Second, the number of samples in machine learning is expected to affect accuracy 71 , however, our study had a limited number of samples.Therefore, future studies will require larger sample sizes and independent test datasets 72 .Third, longitudinal follow-up data might improve model performance, rather than a cross-sectional approach 73 .

Conclusions
Our model achieved 89.8% accuracy to predict Aβ-positivity across a diverse range of neurological and psychiatric disorders.Notably, the SBM revealed a GM volume pattern that had the strongest impact on prediction.Even when using structural brain images alone, the accuracy still reached 84.7%.This MRI-based data-driven machine learning approach may aid clinicians in patient management and early decision-making processes.

Figure 1 .
Figure 1.The area under the curve (AUC) of the final model and the brain image-alone model.The area under the receiver operating characteristic curve (AUC) of the final model (a) was 0.888 (95% CI 0.854-0.973),and of brain image-alone model (b) was 0.830 (95% CI 0.825-0.958).

Figure 2 .
Figure 2. The gray matter volume pattern of independent component 1 in a three-dimensional brain map derived from source-based morphometry.A three-dimensional brain map of independent component 1.The color bar indicates the z-score.The z-score is calculated as (value-mean) / standard deviation, and regions with z-scores greater than or equal to 1 are color-coded.The 3D image was generated using BrainNet Viewer 1.7 (https:// www.nitrc.org/ proje cts/ bnv).

Figure 3 .
Figure 3. Mean SHAP value in fivefold cross-validation.The horizontal and vertical axes represent the mean SHAP value in fivefold cross-validation and features, respectively.(a) Shows the relationship between each feature and the absolute value of SHAP in the analysis.A large absolute SHAP value indicates a significant influence on the prediction.(b) Shows the SHAP values for each participant.This plot summarizes how the top features in the dataset affect the output of the model in the form of information density.The x position of the dots is based on the SHAP value of the feature, and the dots are stacked along each feature row to indicate density.Positive and negative SHAP values were associated with Aβ-positivity and Aβ-negativity, respectively.The red dots indicate high values for each feature, while the blue dots indicate low values for each feature.If the red dots are in the positive SHAP, then the higher the feature value, the more it contributes to the Aβ-positivity.Likewise, if blue dots are in the positive SHAP, the lower the feature value, the more it contributes to the Aβ-positivity.For example, lower scores on immediate and delayed recall of Logical Memory (i.e., LM I and LM II) were associated with Aβ-positivity.IC independent component, JART Japanese Adult Reading Test LM Logical Memory, SHAP SHapley Additive Explanations, TMT-J The Japanese version of Trail Making Test, WF Word Fluency.

Table 1 .
Demographic and clinical characteristics.Values are expressed as mean ± SD unless otherwise indicated.The between-group differences were examined using the independent sample t-test (a) for continuous variables, and χ 2 test (b) for categorical variables.AD Alzheimer's disease, ADAS-cog-J the Japanese version of Alzheimer's Disease Assessment Scale-Cognitive subscale, APOE Apolipoprotein E, CBS Corticobasal syndrome, CDR Clinical Dementia Rating, FAQ Functional Activity Questionnaire, FTLD Frontotemporal lobar degeneration, HC Healthy controls, JART Japanese Adult Reading Test, LM I Logical Memory immediate recall, LM II Logical Memory delayed recall, MCI Mild cognitive impairment, MMSE Mini-Mental State Examination, PSP Progressive supranuclear palsy, Psychiatric Psychiatric disorders, SD Standard deviation, TMT-J The Japanese version of Trail Making Test, WF Word Fluency.*p < 0.01.Vol:.(1234567890)Scientific Reports | (2024) 14:7633 | https://doi.org/10.1038/s41598-024-58223-3

Table 4 .
Relation between each independent component and clinical data.A group comparison analysis of variance (ANOVA) was conducted, and *p < 0.05 after Bonferroni correction.ADAS-cog-J The Japanese version of Alzheimer's Disease Assessment Scale-Cognitive subscale, Aβ amyloid-β, CDR Clinical Dementia Rating FAQ Functional Activity Questionnaire, JART Japanese Adult Reading Test, LM I Logical Memory immediate recall, LM II Logical Memory delayed recall, MMSE Mini-Mental State Examination, TMT-J The Japanese version of Trail Making Test, WF Word Fluency.